The Atari Grand Challenge Dataset

نویسندگان

  • Vitaly Kurin
  • Sebastian Nowozin
  • Katja Hofmann
  • Lucas Beyer
  • Bastian Leibe
چکیده

Recent progress in Reinforcement Learning (RL), fueled by its combination, with Deep Learning has enabled impressive results in learning to interact with complex virtual environments, yet real-world applications of RL are still scarce. A key limitation is data efficiency, with current state-of-the-art approaches requiring millions of training samples. A promising way to tackle this problem is to augment RL with learning from human demonstrations. However, human demonstration data is not yet readily available. This hinders progress in this direction. The present work addresses this problem as follows. We (i) collect and describe a large dataset of human Atari 2600 replays – the largest and most diverse such data set publicly released to date (ii) illustrate an example use of this dataset by analyzing the relation between demonstration quality and imitation learning performance, and (iii) outline possible research directions that are opened up by our work.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Quality-driven and real-time iris recognition from close-up eye videos

This paper deals with the computation of robust iris templates from video sequences. The main contribution is to propose (i) optimal tracking and robust detection of the pupil, (ii) smart selection of iris images to be enrolled, and (iii) multi-thread and quality-driven decomposition of tasks to reach real-time processing. The evaluation of the system was done on the Multiple Biometric Grand Ch...

متن کامل

General Video Game Playing

One of the grand challenges of AI is to create general intelligence: an agent that can excel at many tasks, not just one. In the area of games, this has given rise to the challenge of General Game Playing (GGP). In GGP, the game (typically a turn-taking board game) is defined declaratively in terms of the logic of the game (what happens when a move is made, how the scoring system works, how the...

متن کامل

Robust local features for remote face recognition

Article history: Received 25 October 2015 Received in revised form 28 March 2017 Accepted 13 May 2017 Available online 31 May 2017 In this paper, we propose a robust local descriptor for face recognition. It consists of two components, one based on a shearlet-decomposition and the other on local binary pattern (LBP). Shearlets can completely analyze the singular structures of piecewise smooth i...

متن کامل

A Grand Convergence in Mortality is Possible: Comment on Global Health 2035

The grand challenge in global health is the inequality in mortality and life expectancy between countries and within countries. According to Global Health 2035, the Lancet Commission celebrating the 20th anniversary of the World Development Report (WDR) of 1993, the world now has the unique opportunity to achieve a grand convergence in global mortality within a generation. This article comments...

متن کامل

Efficient Iris Identification with Improved Segmentation Techniques

In this chapter, the authors propose and implement an improved iris recognition method based on image enhancement and heuristics. They make major improvements in the iris segmentation phase. In particular, the authors implement the raised to power operation for more accurate detection of the pupil region. Additionally, with their technique they are able to considerably reduce the candidate limb...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • CoRR

دوره abs/1705.10998  شماره 

صفحات  -

تاریخ انتشار 2017